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TESTING  HYPOTHESES  WHICH  ARE 
UNIONS  OF  LINEAR  SUBSPACES 


by 

Roger  L.  Berger  and  Dennis  F.  Sinlair 


ABSTRACT 

- — ^  The  likelihood  ratio  test  (LRT)  for  hypotheses  which  are  unions  of 
linear  subspaces  is  derived  for  the  normal  theory  linear  model.  A  more 
powerful  variant  of  the  LRT  is  proposed  for  the  case  in  which  the  subspaces 
are  not  all  of  the  same  dimension.  A  theorem  is  proved  which  may  be  used 
to  identify  hypotheses  which  are  unions  of  linear  subspaces.  Some  hypotheses, 
of  particular  relevance  in  ecology,  concerning  the  spacings  between  normal 
means  are  shown  to  be  unions  of  linear  subspaces  and  are  therefore  testable 

/ 

using  the  LRT.  Finally,  the  computation  of  the  LRT  statistic  is  discussed.  <^7 
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1.  Introduction. 

Character  displacement  is  an  ecological  process  by  which  coexisting 
species  diverge  in  size  to  reduce  competition  (see  Grant,  1972,  and 
Sinclair,  Mosimann,  and  Meeter,  1982,  for  details).  The  size  variable 
is  chosen  to  reflect  competition  between  species,  e.g.,  bill  length  for 
birds.  A  typical  situation  is  as  follows.  An  island  has  been  colonized 
by  J  species  within  some  family  from  the  nearby  mainland.  If  character 
displacement  is  occurring,  the  species  should  be  more  dissimilar  on  the 
island,  where  the  variety  of  resources  is  limited,  than  on  the  mainland. 
Recently,  there  has  been  much  controversy  in  the  ecological  literature 
concerning  the  existence  of  character  displacement  (Strong,  Szyska,  and 
Simberloff,  1979;  Grant  and  Abbott,  1980;  Hendrickson,  1981;  and  Strong 
and  Simberloff,  1981).  We  list  below  four  hypotheses  arising  in  character 
displacement  studies.  These  hypotheses  are  all  in  a  class  for  which  we 
derive  the  LRT.  Tests  of  these  hypotheses  may  help  resol  e  the  controversy 
surrounding  character  displacement. 

Let  the  mean  size  measurements  be  denoted  i  =  1  (island),  2  (main¬ 
land),  j  «  1,  ...,  J  (species).  Let  the  ordered  species  means  on  the 
mainland  and  the  island  be  denoted 

wi(l)  5  yi(2)  *  *  wi(J)'  1  “  2‘ 

The  four  character  displacement  hypotheses  relate  to  these  ordered  means. 
They  are  the  following. 

H0:  yl(j+l)  '  yl(j)  "  yl(J+l-j)  “  v(J-j)'  *  " 

where  [s]  denotes  the  greatest  integer  less  than  or  equal  to  s. 


Ho:  wi(j+n  '  ui(j)  “  j e  2»  •••»  J‘l» 

where  Cj,  ...,  Cjj  are  specified  positive  constants  with  c^  =  cj_j» 
j  *  2,  . ..,  J-2,  and  c^_j  »  1. 


H0:  Wl(j*l)  ‘  Ul(j)  “  W2(j+1)  ’  u2(j)'  j  "  1 . J”1* 


4  yl(3*D  ’  Wl(3)  ‘  W2(j*l)  "  y2(j)» 


Ho: 


Ul(j*l)  "  Ml(j)  *  Cj(wl(2)‘Ml(l))' 


if  . « # ,  J- 1 


2,  • • ■ j  J — 1 


where  the  Cj  are  as  in  H*. 


Hypothesis  Hq  states  that  the  island  neans  are  arranged  in  a  symmetric 
2 

fashion.  Hypothesis  HQ  specifies  the  relative  sizes  of  the  spacings 
between  the  means  in  the  symmetric  pattern.  A  commonly  claimed  mani¬ 
festation  of  character  displacement  is  equally  spaced  species  sizes  (Strong, 

Szyska,  and  Simberloff,  1979)  corresponding  to  c..  *  1,  j  »  2,  ...»  J-l, 

2  3 

in  Hq.  Hypothesis  Hq  states  that  the  corresponding  spacings  on  the  island 
and  the  mainland  are  equal  while  the  relative  sizes  of  the  spacings  are 

4 

specified  in  Hq. 

In  Section  2  we  derive  the  LRT  for  hypotheses  which  are  unions  of 
subspaces.  (Throughout  this  paper,  "subspace"  refers  to  a  linear  subspace.) 
In  Section  3  we  prove  a  theorem  which  may  be  used  to  identify  hypotheses 
with  this  property.  We  use  the  theorem  to  show  that  the  above  four  hypo¬ 
theses  are  unions  of  subspaces  and  thus  may  be  tested  with  the  LRT  of 
Section  2.  Finally,  in  Section  4  we  discuss  how  the  test  statistic  may 
be  computed. 
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2.  Likelihood  Ratio  Test. 

Let  Xj,  — ,  denote  independent  normal  random  variables  with 

2 

means  and  common  variance  a  .  We  assume  that  £  »  (£j,  . ...  CR) 

Y 

lies  in  «,  a  subspace  of  R  of  dimension  J  <  X.  For  example,  ,  . ..,  XR 
may  be  comprised  of  independent  samples  from  J  populations  (J  <  K).  We 
will  discuss  testing  hypotheses  about  £  which  are  unions  of  subspaces  of 
u).  In  this  section  we  derive  the  LRT  of 


(2.1) 


H_:  £  e  versus  H. :  £  e  w-ui. 


where  wn  *»  u  u.  and  each  u>.  is  a  q.  dimensional  subspace  of  u>. 
u  i=l  1  1  1 


We  will  show  that  the  critical  value  for  the  LRT  is  a  multiple  of  a  percentile 
from  an  F  distribution.  We  will  also  describe  a  modification  of  the  LRT 
which  has  a  higher  power  than  the  LRT  if  at  least  two  of  the  uk  have  dif¬ 
ferent  dimensions.  Throughout  we  assume  that  (2.1)  has  been  expressed  in 
such  a  way  that  the  u>.  are  all  distinct,  that  is,  there  do  not  exist  i  and 
j,  isi,  j  sm,  i  *  j,  such  that  ok  <=  tty 

Let  the  density  of  £  *  (Xj,  X^)'  be  denoted  by 


(2.2) 


P(S;  £»*)  *  (2xo2)’K/2e:cp(-  l  (x.-l- .)2/2o2) . 

i-1  1  1 


Let  Qj  ■  {(£,  a):  £  e  o  >  0),  0Q  ■  {(£,  o) :  £  e  o)Q,  o  >  0}  »  u  0j 
~  ~  i»l 

and  0  ■  {(£,  o):  £  e  w,  a  >  0).  The  LRT  statistic  for  testing  is  defined 


(2.3) 


sup  pQs;  £,<0 

90 

sup  p(j$;  £,5T  ‘ 

0 


If  we  let  l  denote  the  projection  of  £  on  u  and  denote  the  projection 
of  £  on  ok,  then  X(g)  is 


max  sup  p(x;  £,o) 
lsism  0i 

"  sup  p(x;  £,o) 

e 


(2.4) 


max 

lsism 


sup  pte;  c.o) 

Gi 

sup  p(x;  £,c) 

e 


^K/2 


>  max 
lsism 


iit-tii2 
“in  lli'ljll2 

lsism  1 


K/2 


where  ||y||  *  Y'Y-  The  third  equality  in  (2.4)  is  a  standard  result  from 

M#  W  M 

linear  models  theory.  The  last  expression  in  (2.4)  reflects  the  fact  that 

the  maximum  likelihood  estimate  of  £  under  HQ  is  the  projection  of  X  on 

^ 

the  nearest  subspace  Since  ||x-£.||  ■  ||x-S||  ♦  II S4.  ||  ,  rejecting 

HQ  if  A(g)  <  c  is  equivalent  to  rejecting  HQ  if  X*(g)  >  c*  where 


(2.5) 


X*(x) 


min  ne-ej2 

lsisra  ~ 

lla-Ill2 


The  value  of  c*  which  produces  a  size  a  test  is  given  in  Theorem  1. 


s 


Theorem  1.  Let  F_  _  u  denote  the  upper  100  a-percentile  of  an  F  distri- 

* "■ 1  " ' _  ®|ajD 

bution  with  a  and  b  degrees  of  freedom.  Let  c*  «  (  max  (J-q.)F  .  v  T)/(K-J) 

“  isism  1  a»J-VK"J 

The  test  which  rejects  HQ  if  X*(X)  >  c*  is  a  size  a  test,  that  is,  the  test 


satisfies 


(2.6) 


sup  P5>a(*#(£)  >  c*)  *  «• 


0O  - 


Proof.  For  any  £  «  a>.  and  a  >  0, 

-  ~  j 


p5i<i(»*(js)>cj)  s  p5,0(llHjll2/ll*-Ill2  >  <s> 


s  p5.,(|ihjii2/ii*-|ii2  » 

~  J 


The  last  equality  is  true  since  a  standard  result  from  linear  models  theory 
states  that  for  any  Sew.  and  a  >  0,  [(K-J)/(J-q.))  ||c-£.  ||2/||x-t||2  has 
an  F  distribution  with  J-q^  and  K-J  degrees  of  freedom.  Thus 


(2.7) 


sup  Pc  0(X-QS)  >  <£)  s  a. 


To  prove  the  reverse  inequality,  let  j  be  such  that 

(J-q.)F  .  „  ,/(K-J)  ■  max  (J-q.)F  .  „  T/(K-J)  -  c*.  We  have 

j  a,J-q.,K-J  lsian  i  a,J-q^,K-J  a 

assumed  that  uk  is  not  a  subset  of  for  any  i  *  j .  So  for  every 

i  *  1,  ...,  m,  i  *  j,  Uj  n  in.  is  a  subspace  of  dimension  at  most  q^-1. 

m 

The  set,  u  (w.  n  w. ),  cannot  contain  the  q.  dimensional  set  w.  since  each 
i-1  1  J  >  7 
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set  in  the  union  is  at  most  (q.-l)  dimensional.  Thus  there  exists  £*, 

*  J  m* 

such  that  C*  e  u.  and  C*  i  u.  for  any  i  a  1,  ....  m,  i  *  j. 

^  J  1 

Fix  a  >  0.  We  will  consider  the  sequence  of  parameter  points  (k£*,  a), 
k  «  1,  2,  ...  .  Note  that  k£*  €  w.  for  every  k  since  <a.  is  a  subspace. 

~  J  J 

Let  £?  denote  the  projection  of  £•  on  <d. •  Then  the  projection  of  k£*  on 
u>.  is  k£*  and  j|k£*-k£?||2  *  k2||£*-£?||2  -  as  k  -►  -  since  |||*-£?||2  >  0 

for  i  *  1,  . ...  m,  i  *  j. 

For  i  *  1,  ....  m,  let  *  {j:  iH-^ll2/ llx-|l|2  >  e£>.  At  (k£*.  o) 

[(K-J)/(J-q^) ]  || 5*1  j  !i 2/ Ili5"£ll 2  has  a  noncentral  F  distribution  with  J-q.^ 

2  2  2 

and  K-J  degrees  of  freedom  and  noncentrality  parameter  6^  ^  *  k  ||  / a  . 

For  i  *  1,  b,  i  *  j,  ^  ■  as  k  +  ».  So  for  i  *  1,  . ...  m,  i  *  j, 

(2.8)  Pk£*»o*Ri*  *  1  M  k  ■*“  “♦ 

On  the  other  hand  [(K-J)/(J-q^)]  j|2/|ls-|||2  has  a  central  F  distri¬ 

bution  with  J-q^  and  K-J  degrees  of  freedom.  Furthermore, 

ej  ■  Thus 

(2.9)  0(R.)  »  o  for  every  k  *  1,  2,  ...  . 

ms 

Using  (2.8)  and  (2.9)  we  obtain 


■  •  v.4j^) 


m 

2  1  -  lim  l 
k-*»  i-1 
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-  1  -  (1 -a)  -  lim  l  P  (R?) 

k-»®  i-1  »a  1 


■  a  -  0  *  a. 


Since  (k£*,  o)  «  0.  c  ©  for  every  k, 

~  j  u 

(2.10)  sup  Pr  „(X*QC)  >  c*)  s  lim  P.  .*  „(X*(X)  >  c*)  a  a. 

0Q  Z’a  °  k-K»  K£  »a  ° 

Combining  (2.7)  and  (2.10)  yields  (2.6).  || 


We  believe  that  in  most  applications  all  of  the  cik,  i  =  1,  . ...  m, 
will  have  the  same  dimension  q,  in  which  case  c  »  (J-q)F  T  „  ./(K-J). 
This  is  the  case  for  all  the  examples  we  discuss  in  Section  3.  But  if 
the  dimensions  of  some  of  the  differ,  there  is  a  modification  of  the 
LRT  which  is  also  a  size  a  test  and  has  higher  power  than  the  LRT.  This 
test  is  described  in  Theorem  2;  it  is  the  LRT  if  all  the  coi  have  the  same 
dimension. 


Theorem  2.  Let  -  [(K-J)/(J-q.)J  flj-fjl' V||x-f||2,  1  -  1,  The 

test  X**  which  rejects  HQ  if  and  only  if  >  F^  for  every 


i  ■  1,  ....  n  is  a  size  a  test.  The  test  X**  has  a  power  which  is  greater 
than  or  equal  to  the  power  of  the  LRT  for  every  (£,  o)  c  9. 


Proof.  The  proof  that  X**  is  a  size  a  test  is  almost  identical  to  the 
proof  given  in  Theorem  1  that  X*  is  a  size  a  test.  In  this  case  any 
i  *  1,  ....  m,  can  play  the  special  role  played  by  in  the  second  half 


of  Theorem  1. 
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i 


The  {j:  **(*)>  c*}  c  (j:  >  FQ  J  q  KJ,  i  ■  1,  . ..,  m>.  That  is, 

the  rejection  region  for  the  LRT  is  a  subset  of  the  rejection  region  for 
X**.  Thus  the  power  of  X#*  is  greater  than  or  equal  to  the  power  of  the 
LRT.  || 

Unless  all  of  the  quantities  (J-q.)F  T  v  ./( K-J),  i  =  1,  ...»  m, 

are  equal,  the  rejection  region  for  the  LRT  is  a  proper  subset  of  the 

rejection  region  for  X**  and  the  power  for  the  LRT  is  strictly  smaller 

than  the  power  of  X**  for  every  parameter  in  .  This  provides  an  example, 

like  that  of  C.  Stein  (see  Bickel  and  Doksum,  1977,  p.  239),  of  a  LRT 

whose  power  is  everywhere  dominated  by  the  power  of  another  test. 

The  test  which  rejects  H..:  Sew.  if  F.  >  F  .  ,  .  is  a  size  a 

J  Oi  Z  i  l  a.J-q^K-J 

test  of  Hq..  The  test  of  HQ  in  Theorem  2  rejects  HQ  if  and  only  if  for 

each  i  the  test  based  on  Fi  rejects  B0i-  Tests  of  this  form  have  been 
called  intersection-union  tests  by  Gleser  (1973);  they  have  also  been 
discussed  by  Berger  (1982). 


3.  Hypotheses  Consisting  of  Linear  Subspaces. 

In  the  remaining  sections,  we  discuss  some  specific  problems  which 
fall  into  the  general  framework  described  in  Section  2.  These  problems 
involve  hypotheses  about  the  spacings  between  normal  means.  The  ecological 
problem  which  motivated  our  interest  in  these  hypotheses  was  described 
in  Section  1. 


We  will  consider  the  following  special  case  of  the  model  presented 
in  Section  2. 
denote  K  *  £ 

i.j 


Let  Xijk’  *  *  r»  3  *  x»  •••»  Ji»  k  ■  •••» 

independent  normal  observations.  The  mean  of  is 
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y^  and  all  the  have  a  common  variance  of  a  .  Let 
M  B  (Wu»  •••»  M21,  ...»  md 

£  3  ^lliijj*  *■’  UU1~K1J  *  W21~K21’  ***#  y  wftere  4  1S  a 

vector  of  b  ones .  In  the  formulation  of  Section  2  we  would  consider 
£  e  <u  c  R  .  Since  here  there  is  a  one  to  one  correspondence  between  y 

M 

and  C,  we  may  equivalently  consider  P  £  ^  where  J  ■  \  J. »  We  now  consider 

^  i  * 

the  subspaces  ok,  i  *  1,  m,  as  subspaces  in  R^. 

We  will  be  concerned  with  permutations  of  (y^j,  . ..,  y^  )  for  each 
i.  a  1|  •  •  •»  I#  Lot  8  (y^f  •••#  )  so  tbftt  y  a  (jUj»  •  •  •  >  Vj)  • 

A  map  ir:  is  called  a  subpermutation  if  ir(y)  =  (wjCy^),  ...» 

where  ir. (y.)  is  a  permutation  of  y.,  i  *  1,  ...»  I.  There  are  n  J.l  sub- 

1  ~1  -1  i=i  1 

permutations.  A  set  A  c  C  is  called  subpermutation  invariant  if  y  e  A 
implies  w(y)  e  A  for  every  subpermutation  a.  A  set  B  is  called  a  subpermu- 
tation  of  a  set  A  if  B  is  the  image  of  A  under  some  subpermutation  ir.  It 
is  easily  verified  that  if  A  is  a  subspace  then  any  subpermutation  of  A 
is  also  a  subspace. 

Ihe  following  theorem  may  be  used  to  identify  hypotheses  which  are 
unions  of  subspaces.  We  shall  use  this  theorem  to  show  that  the  ordered 
mean  hypotheses  in  Section  1  are  unions  of  subspaces  and  hence  are  testable 
using  the  LRT  of  Section  2. 


Theorem  3.  Let  be  a  subpermutation  invariant  subset  of  R^.  Let  H 
denote  a  (J-q)  *  J  matrix  of  rank  J-q.  Let  N  •  {y  £  R*1 :  Hy  ■  Oh  Let 

0  -  (y  c  RJ:  yu  s  yi2  s  ...  S  y^,  i  ■  1,  ....  I).  If  N  n  0  -  wQ  n  0 

m  I 

and  N  c  w  then  wn  ■  u  w.  where  m  ■  n  J. I ,  w.  is  the  q  dimensional  sub- 

0  0  i-1  1  i-1  1  1 

space  N,  and  t»2,  ...,  u>n  are  the  subpermutations  of  N.  Thus  u)Q  is  the 
unio-*  of  q  dimensional  subspaces. 
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Proof.  Suppose  y  c  u  w. .  Then  y  =  ir(y*)  for  some  y*  e  N  and  some  sub- 

^  •  «  X  ^  ^  ** 

1=1 

permutation  tt.  Since  N  c  and  is  subpermutation  invariant, 

m 

y  =  ir(y*)  e  w-.  Thus  u  u.  c  w  . 

~  0  i=l  1  0 

Now  suppose  y  e  Uq.  Let  y*  be  a  subpermutation  of  y  such  that 
y*  £  ...  £  y?T  ,  i  =  1,  ....  I.  By  definition,  y*  c  0.  Also,  y*  e  w_ 

11  lvK  ~  ~  u 

since  is  subpermutation  invariant.  Thus,  y*  e  o>_  n  0  =  N  n  0  c  n. 
u  ~  u 

n 

Since  y  is  a  subpermutation  of  y*  «  N.  M  u.  for  some  i.  Thus  u-  c  u  u. . 

^  1  U  .  •  X 

i=l 

Finally,  since  N  =  is  a  q  dimensional  subspace  of  V? ,  each  of 
the  subpermutations  of  N,  u>2,  ...,  is  also  a  q  dimensional  subspace 
of  RJ.  1 1 


The  subspaces  cu, ,  . ..,  a>  defined  in  Theorem  5  will  not  be  distinct, 
l  m 

As  will  be  seen  in  the  following  examples,  the  number  of  distinct  sub¬ 
spaces  will  be  much  smaller  than  m  (at  most  m/2,  in  fact) .  Recognizing 
this  fact  results  in  a  saving  of  effort  in  the  computation  of  the  test 
statistic  X*  for  which  the  minimum  needs  to  be  taken  only  ovor  distinct  sub¬ 
spaces.  Taking  the  minimum  over  all  m  subspaces  in  Theorem  3  will,  of 
course,  give  the  some  value  of  X*.  It  would  just  be  inefficient  since 
many  of  the  subspaces  are  equal. 

We  now  consider  the  four  hypotheses  about  the  spacings  between  normal 
means  discussed  in  Section  1. 


Example  1.  (Symmetric  Spacings).  For  this  example,  I  =  1,  so  we  will 
denote  the  means  by  y^,  ...,  Vj  and  the  ordered  means  by  y^  s  ...  s  y^j. 
By  the  "symmetric  spacings  hypothesis"  we  mean 
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(3.1)  hJ: 


“(;♦!)  *  w(j) 


w(J*l-j)"w(J-jr 


1. 


C(J-l)/2]. 


We  shall  use  Theorea  3  to  verify  that  is  the  union  of  subspaces.  Let 
H  be  a  [(J-l)/2]  x  j  aatrix  such  that  Hji  ■  0  is  equivalent  to  the  conditions 
Uj+1  -  *  wj*l-j  ~  J  *  I*  [(J-D/21-  If  U  «  0  then  y..  »  y(^. 

j  *  1,  ....  J,  whence  Hy  =  0  is  equivalent  to  y^+lj  -  y^j  = 
j  *  1,  [(J*l)/2],  That  is,  N  n  o  c  ^  n  o  as  required  by  Theorea  3. 

How,  to  verify  that  N  c  Uq.  Let  y  c  N,  that  is,  Hji  =  ft.  Let  y*  =  (Vj+yj)/!. 
For  any  j  *  1,  ...,  [(J-l)/2],  y^  -  y*  =  Uj  -  y*  ♦  jj’J  (y.^-Uj)  =  V*  -  Vj 

*  li.l  (WJ*l-i'UJ-i)  *  V*  "  yJ+l-j*  *>•  eaGh  P*1*’  Fj  ***  is  sy®- 

setrically  placed  about  y*.  (If  J  is  odd,  f(J-l)/2]  ♦  1  *  J  -  [ (J-1J/2]  and 

Wl(J-l)/2]*l  *  vJ-((J-l)/2]  *  If  vt  *  w(j)  vs  *  v(j+l)  1,1611 

yJ*l-r  *  M(J*l-j)  *“d  MJ+l-s  *  w(J-j)*  Thus  w(j*l)  “  w(j)  *  vs  ‘  p*  “  (vr";,4) 

"  *»•  *  »*J*l-s  *  iv*’VJ+l-T*  *  lhe«foT«»  V  €  -0*  1,61,66 

Nc^is  required  by  Theorea  3.  By  Theorea  3,  the  symmtric  spacings 

hypothesis  is  the  imion  of  subspaces  of  dimension  J  -  [(J-l)/2]  and  A*  can 

be  used  to  te.:fc  hJ. 

This  exaaples  gives  a  good  illustration  of  the  fact  that  the  a  subspace:, 
defined  in  Theorea  3  are  not  all  distinct.  Let  J  *  4.  In  this  exaaple, 

■  s  41  «  24  but  actually  consists  of  only  3  distinct  subspaces.  The  H 
defined  in  the  previous  paragraph  can  be  written  as  H  *  (1,  -1,  -1,  1). 

These  eight  permutations  of  N  *  (g:  Hji  *  0}  all  equal  N  itself:  (1,  3,  •♦)» 

(1.  3,  2,  4),  (4,  2,  3,  1),  (4,  3,  2.  1),  (2,  1,  4,  3),  (2,  4,  1,  3), 

(3,  1,  4,  2),  (3,  4,  1,  2).  These  eight  permutations  of  N  all  yield  the 
subspace  defined  by  (1,  -1,  1,  -l)y  »  0:  (1,  2,  4,  3),  (1,  3,  4,  2), 

(4,  2,  1,  3),  (4,  3,  1,  2),  (2,  1,  3.  4),  (2,  4,  3,  1),  (3,  1,  2,  4), 


(3,  4,  2,  1).  These  eight  permutations  of  N  all  yield  the  subspace  defined 

by  (1,  1,  -1,  -l)u  *  0:  (1,  4,  2,  3),  (1,  4,  3,  2),  (4,  1,  2,  3),  (4,  1,  3,  2), 

(2,  3,  1,  4),  (2,  3,  4,  1),  (3,  2,  1,  4),  (3,  2,  4,  1). 

Example  2.  (Symmetric  Spacings  with  Specified  Ratios) .  Again  I  *  1  so 
the  notation  of  Example  1  is  used.  Now  the  hypothesis  of  interest  is  a 
subhypothesis  of  hJ,  namely 

2 

(3.2)  Hq:  *  w(j)  a  Cj^w(2)"y(l)^’  3  *  2*  •••»  J”1 

where  c2,  . ..,  Cj^  are  specified  positive  constants  with  c^  *  Cj  . , 

j  *  2,  . ..,  J-2,  and  Cj^  ■  1.  The  restrictions  on  the  c^  imply  that  the 

12 

symmetric  spacings  are  equal  as  in  HQ.  HQ  can  be  used  to  test  whether  the 
means  are  spaced  like  the  expected  values  of  order  statistics  from  some 
symmetric  distribution.  For  example,  if  c^  »  1,  j  »  2,  ...,  J-l,  the  distri¬ 
bution  is  the  uniform.  If  J  •  5,  Cj  *  Cj  *  .74111  and  c^  »  1,  the  distri- 

2 

bution  is  the  normal.  Theorem  3  can  be  used  to  verify  that  HQ,  which  is 
subpermutation  invariant,  is  the  union  of  subspaces.  Let  H  be  a  (J-2)  x  J 
matrix  such  that  Hu  »  0  is  equivalent  to  the  conditions 
j  *  2,  ...,  J-l.  Arguing  as  in  Example  1  it  is  easy  to  verify  that 
N  n  0  ■  u»0  n  0.  To  verify  that  N  c  u>0,  let  u  e  N,  that  is,  Hu  *  0.  If 
Ui  s  u2»  then  Vj  s  U2  *  •••  *  Uj  since  c^  >  0,  j  *  1,  ....  J-l.  Thus 
jitNnO*iiijj(iOc  Uq.  If  uj  u2>  ^en  Uj  *  u2  2  •  •  •  *  Uj*  For  cr>y 
j  *  2,  ....  J-l,  M(j+1)  -  u(j)  "  yj.j  -  uJ+1_j  *  ‘Cj.jCwrWj)  =  cj. j fwj.j-Pj) 

=  cj  (uj.i'Mj)  *  cj  ^W(2)*W(1)^  *  ^'e  secon<*»  third  and  fourth  equalities  an.- 

true  since  Hu  ■  0,  Cj_j  ■  1,  and  c^  ■  Cj_j.  Thus  £  e  By  Theorem  3, 

Uq  is  the  union  of  2-dimensional  subspaccs  and  \*  can  be  used  to  test  H^. 


Example  3.  (Equal  Spacings  in  TWo  Sets  of  Means) .  In  this  example  there 
are  two  sets  of  means  of  interest,  1=2  and  Jj  =  J2  *  J'  *  J/2.  We  are 
interested  in  testing  the  hypothesis  that  the  spacings  in  the  first  set, 

Pjj,  Vjj*.  are  ©qual  to  the  spacings  in  the  second  set,  y21,  . ..,  w2j" 
That  is,  we  wish  to  test 

(3.3)  Hq:  *  w2(j4-l)  ~  w2(j) *  1  m  l>  •••»  J'-!* 

3 

To  verify  that  HQ,  which  is  subpenmitation  invariant,  is  the  union  of  sub¬ 
spaces,  let  H  be  a  (J'-l)  x  J  matrix  such  that  Hy  =  0  is  equivalent  to  the 
conditions  “  Ujj  *  U2  j+1  ”  y2j*  3  *  1»  •••»  J'-l.  As  in  Example  1, 

it  is  easily  verified  that  N  n  0  =  wA  n  0.  To  verify  that  N  c  w  let  y  e  N. 

For  any  j  .  1,  ....  J',  *  V21  =  *  !,21 

=  y^  +  Cvi2i~v»n) -  Thus  the  set  of  means,  y21,  ...,  w2j-»  is  a  translation 
of  the  set  of  means,  y^,  ...,  y^*,  the  amount  of  translation  being  (u21-y;,). 
Thus  the  spacings  among  u21,  V2j-,  are  all  equal  to  the  spacings  among 

•••»  Vjj*  and  £  e  u>Q.  By  Theorem  3,  u>q  is  the  union  of 
subspaces  of  dimension  J  -  J'  ♦  1  «  J/2  ♦  1  and  \*  can  be  used  to  test 

This  argument  can  be  easily  extended  to  the  situation  in  which  one 
wishes  to  test  for  equal  spacings  in  I  (I  >  2)  sets  of  means.  In  this  cs^o 
H  is  an  (I-1)(J'-1)  x  j  matrix  and  the  subspaces  are  of  dimension 
J  -  (I-l)GT-l)  ■  I  ♦  J'  •  1  ■  I  ♦  J/I  -  1. 


Example  4.  (Equal  Spacings  in  Two  Sets  of  Means  with  Specified  Ratios). 
For  this  example,  the  notation  is  the  same  as  in  Example  3.  We  combine 
the  ideas  in  Examples  2  and  3  to  consider  testing 


(3.4)  hJ: 


Ml(3’l)  "  Wl(3)  "  V2(j+1)  ’  W2(j) * 


3=1,  ...» 


J'-i 
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where  the  c.  satisfy  the  same  conditions  as  in  Example  2.  This  null 
hypothesis  is  also  subpernutation  invariant.  If  we  let  denote  the 
subspaces  in  Example  2  (now  considered  as  subspaces  of  R*7  with 
y2^,  •••»  u2j-  unrestricted)  and  let  denote  the  subspaces  in  Example  3, 

4 

we  see  that  for  H0,  can  =  (u  u  )  n  (u  n  )  a  u  (w  n  n  )  •  But  for  every  r 

r  r  s  5  r,s  r  s 

and  s,  u  n  n  is  a  linear  subspace  of  dimension  3.  (The  subspace  u 

X  S  T1 

involves  J'-2  restrictions  and  n  involves  an  additional  J'-l  restrictions. 

s 

This  results  in  u>r  n  having  dimension  3.)  Thus  A*  can  be  used  to  test 


4.  Computation  of  the  Test  Statistic. 

As  in  standard  linear  models  theory,  the  LRT  statistic  A*  can  be 
expressed  as  a  product  of  matrices.  This  will  simplify  the  computation 
of  A*.  Furthermore,  for  the  hypotheses  described  in  Section  3,  A*  can  be 
computed  without  any  minimization.  These  points  will  be  discussed  in  this 
section. 

Let  the  J  dimensional  subspace  u>  be  defined  as  ui  *  {£:  £  =  W$,  6  e 
where  W  is  a  known  KxJ  design  matrix.  Let  the  subspaces  be  defined  by 
“i  °  (C;  £  »  W8,  Hjjg  «  0}  where  fT  is  a  known  (J-qj)*J  matrix  of  rank 
J-q^,  i  *  1,  ...,  m.  Let  B  ■  (W'W)**W'X.  Then  analogous  to  standard 
linear  models  theory  we  can  write 

SSH0 

(4.1)  **(£)  ■ 

where 


(4.2) 


SSHq  »  min  |'H'(Hi(W'W)"1H:)'JHi| 


IS 

and 

(4.3)  SSR  ■  (X-W8) '(X-Wg) . 


Expression  (4.2)  is  true  since  the  numerator  of  X*  in  (2.5)  is  the  minimum 
of  the  sums  of  squares  associated  with  each  of  the  hypotheses  H^:  £  e  oj^. 
By  standard  linear  models  theory  these  sums  of  squares  are  the  expressions 
given  in  (4.2). 

For  the  examples  considered  in  Section  3,  the  minimum  in  (4.2)  does 


not  need  to  be  computed.  It  is  possible  to  determine  which  sum  of  squares 
will  be  the  minimum  by  examining  the  order  of  the  sample  means,  as  is  shown 
below.  Thus  only  one  sum  of  squares  needs  to  be  computed  for  these  models. 
For  the  remainder  of  this  section,  assume  the  model  defined  in  Section  Z 


For  this  model  g  ■  X  ■  (Xj,  ...,  X') '  where  X.  *  (X^,  ...,  X.^  )'  and 
^ij  “  t-k»l  Xijl/Kij*  Let  t*ie  matr^x  f*0®  Theorem  3,  be  partitioned  as 
H  *  (Cj  :  ...  :  Cj)  where  is  a  (J-q)xJ^  matrix  with  columns  c^,  ...,  c^j 
i  »  1,  ...»  I.  For  i  ■  1,  ...,  I,  suppose  w.(l),  ...,  itj(J.)  is  a  peTmu- 


tation  of  1, 
matrix  H*  by 


J.  such  that  X.  ...  s  ...  s  X..  M  .. 


Define  a  (J-q)xJ 


(4.4)  H*  -  (CJ  :  ...  :  CJ) 

where  C?  is  a  (J-q)xj.  matrix  with  columns  cj.,  ...,  c*  and  c?.  *  c  . 
i  x  ~xj 

Then  the  numerator  of  X*,  SSHp,  is  given  in  Theorem  4. 

Theorem  4.  If  H,  N,  0,  and  Mq  satisfy  the  conditions  of  Theorem  3  and  H* 
is  defined  by  (4.4)  then 

(4.5)  SSHq  ■  g 'H* ' (H* (VT W) " *H* ' ) " XH*X . 


1 
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Thus  the  Matrix  H*  can  be  constructed  by  pe routing  the  coluans  of  H 
in  a  way  dictated  by  the  order  of  the  sample  Means.  Then  SSHq  can  be 
computed  directly  because  H*  is  the  particular  FL  which  minimizes  the  sum 
of  squares  in  (4.2).  The  proof  of  Theorem  4  uses  the  following  two  lemmas. 


1.  Suppose  H,  N,  0  and  u  satisfy  the  conditions  of  Theorem  3  and 


H* 


is  defined  by  (4.4).  Let  N*  a  (ji  £  H*y  **  0).  Let 

«w  ^ 


o*  •  {„  .  IT1:  S  ... 

and  N*  n  0*  »  u0  n  0*. 


s  n  'i*  1  *  l»  •••»  x>*  ^en  N*  c  wn 


Proof.  For  any  y  e  If1,  let  *(u)  »  (<,((■,),  ....  *.(y.))'  where 

**  X  1  <wl 


»iC*ii)  *  •••»  v*~n  ^  £44  *  (c44i«  ••*»  ci 


iw(l)*  “iw^r 


*ij  ^ijl* 


ij(J-q) 


)'.  Then 


for  any  w  c  and  1  s  r  s  J-q,  the  r**1  coordinate  of  H*(y)  is 

*w  m* 


*1=1  h-l  cljr“lMJ)  “d  rth 
J.  1  ,  J. 


coordinate  of  H*y  is 


^i»l  ^j-1  Cijrwij  *  ^i-1  £j-l  Ciw-l(j)rWij  *  5»I  cijrwi*(j)‘  11,115 
Hw(y)  »  H*ii  for  every  y  c  If1. 

M  A#  #W 

If  y  €  N*,  g  «  H*y  =  H*(y).  Thus  w(y)  c  N  c  »Q.  Since  *»0  is  sub- 
permutation  invariant,  y  e  «q.  Thus  N*  c  and  hence  H*  n  O1  c  ^  n  01. 
If  y  e  w.  n  0V,  then  *(y)  e  0.  Furthermore  *(y)  c  w.  since  u.  is 

•W  U  M  U 

subperautation  invariant.  Thus  w(y)  e  N.  So  0  ■  H*(y)  »  H*ji  and,  hence.. 
ji  e  N*.  Thus  y  e  M*  n  0*,  and  hence  *»Q  n  0*  c  N*  n  0*.  1 1 

Lemma  2.  Suppose  ri^  s  ...  s  X4_/T  >  tor  i  *  1,  ...,  I.  Then  for  any 


e*\ 


HwjU)  *  *  •  * 


(4.6) 


I  Ji 


L  _ 

i-1  j«l 


I  ^i 


where  is  a  s-r, permutation  of  y  satisfying 

..  4  .  *  T 

t.  •  A  **  A  «  •  ■  •  I  Z  t 


"‘V-V 
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Proof.  Let  g(X^,  jp.)  ■  -  J  (Xjj-W^j)  ■  G(£.-p.).  It  is  easily  shown 

(Marshall  and  Olkin,  1979,  p.  57)  that  G  is  Schur- concave.  It  follows  by 
Lemma  2.2  of  Hollander,  Pros chan,  and  Sethuraman  (1977)  that  g  is  decreasing 
in  transposition  and,  hence. 


J.  J. 

X  .2 


"  X.  Kij^ij"yij)  *  ~  X.  K**^*"w*i)  • 


j=l 


j*l 


i  j  i  j  ijJ 


Inequality  (4.6)  follows  since  the  above  inequality  holds  for  each 
i  *  1.  ....  I.  11 


Proof  of  Theorem  4.  Let  N*  and  o"  be  as  in  Lemma  1  and  let 

-  j— 

g(£»  p)  *  l  l  K,.(X. ,-y. .)  .  The  numerator  of  X*QC)  is 
i»l  j=l  13 


SSHq  -  inf  g(X,  y) 

P«“0 


s  inf  g(X,  p)  (since  N*  c  u»n  by  Lemma  1) 
pcN*  ~  0 

(4.7) 

s  inf  _  g(X,  p)  (since  un  n  0*  c  N*  by  Lemma  1) 

y€wnn 0 
**  u 


■  inf  g(X,  p), 
y6W0 


the  last  equality  being  true  by  Lemma  2  since  Uq  is  subpermutation  invariert . 
The  second  and  last  expression  in  (4.7)  are  equal  so  the  inequalities  are 
equalities.  But  by  standard  linear  models  theory,  the  expression  involving 
N*  in  (1.7)  is  the  right  hand  side  of  (4.5).  [| 
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We  will  now  compute  a  small  example  using  Theorem  4.  Suppose  we  wish 
to  test  the  symmetric  spacings  hypothesis  of  Example  1  with  J  »  3  popu¬ 
lations.  So  H  *  (1,  -2,  1)  can  be  used.  Suppose  n  *  4  with  two  observations 
on  population  2,  so 


fl  0  ol 


W  * 


0 

0 


1 

1 


0 

0 


10  o  1J 


Suppose  the  observed  mean  vector  is  |  »  (1,  10,  4)'.  Then  H*  =  (1,  1,  -2). 
Equation  (4.S)  yields  SSHq  *  18/11.  The  maximum  likelihood  estimate  of  jj 
obtained  as  the  projection  onto  N*  is  £  »  (5/11,  107/11,  56/11).  The  sums 
of  squares  corresponding  to  the  projections  onto  the  other  two  subspaces 
in  u>0,  those  defined  by  Hy  »  0  and  (-2,  1,  l)y  *  0,  are  225/4  and  288/11. 
Clearly  SSHq  is  the  minimum  of  the  three  sums  of  squares. 
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The  likelihood  ratio  test  (LRT)  for  hypotheses  which  arc  unions  of  linear 
subspaces  is  derived  for  the  normal  theory  linear  model.  A  more  powerful  variant 
or  the  LRT  is  proposed  for  the  case  in  which  the  subspaces  are  not  all  of  the 
same  dimension.  A  theorem  is  proved  which  may  be  used  to  identify  hypotheses 
which  are  unions  of  linear  subspaces.  Some  hypotheses,  of  particular  relevance 
in  ecology,  concerning  the  spacings  between  normal  means  are  shown  to  be  unions 
ox  linear  subspaces  and  are  therefore  testable  using  the  LRT.  Finally,  the 
computation  of  the  LRT  statistic  is  discussed. 


